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METHOD FOR AUTOMATED SYSTEM IDENTIFICATION 



FIELD OF THE INVENTION 

The present invention relates generally to system identification and, 
5 more particularly, to a method for automated system identification of dynamic 
systems. 

BACKGROUND OF THE INVENTION 

The design and manufacture of new products has become an 
increasingly complex activity due to the high performance required by the 

10 users of such products. Therefore, many products are designed to incorporate 
high performance signal processing and/or control schemes. The methods 
used in designing high performance signal processing and /or control schemes 
require mathematical models of the systems under consideration. Control and 
signal processing engineers construct these mathematical models using 

15 established modeling methods. 

One family of methods for constructing models is known as system 
identification. In order to use system identification methods, system designers 
must rely on data gathered from experiments conducted on the system under 
consideration, as well as on prior knowledge of the behavior of the system. 

20 Most system identification methods are iterative and seek to improve model 
accuracy through repeated experiments and numerical computations. The 
resulting accurate models can be used to design high performance signal 
processing and /or control schemes for the systems under consideration. 

An example where system identification is used is in communication 

25 systems. A key component of any analog or digital communication system is a 
communications channel. The communications channel is the medium through 
which a signal is transmitted and received. For a Digital Subscriber Line (DSL), 
the channel may include the analog transmitter electronics, the copper wiring 
that connects the central office and the customer modem, and the analog 

30 receiver electronics. For a wireless communications system, the channel may 
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include the analog transmitter electronics, transmitting antenna, 
electromagnetic propagation to the receiving antenna, the receiving antenna 
itself, and analog receiver electronics. Accurate channel models play a critical 
role in analyzing and designing communications systems. 
5 Several system identification methods are available to system designers. 

Many of these methods are encoded in existing system identification software 
tools. A typical example of a system identification tool is MATLAB® System 
Identification Toolbox, available from Mathworks, Inc., Natick, Massachusetts, 
which is a state-of-the-art system identification package having a graphical user 
10 interface (GUI). To successfully use the MATLAB® System Identification 
Toolbox, the control system designer must interpret results and make 
numerous complex decisions in the areas of identification experiment design 
D and refinement, experimental data quality analysis, and model quality analysis. 

i: '£ 

■CIS" 

m These system identification tools thus require experienced users with specific 

15 expertise in system identification theory. In addition, these methods leave the 
construction and refinement of system identification experiments up to the 
user. 

Therefore, what is needed is a tool for the identification of dynamic 
systems that automates the entire identification process. Further, the tool 
20 should require little or no specific knowledge of system identification theory 
from the control system designer who uses it. 



Li 



3 



SUMMARY OF THE INVENTION 

A method for automated system identification is disclosed. A model 
structure is selected and reference signal values are generated for input into the 
system. Input signal values and output signal values are retrieved from the 
5 system and system identification is performed on the model structure using the 
input signal values, the output signal values, and the reference signal values. A 
point model, obtained as a result of the system identification, is then verified for 
accuracy. 

Other features and advantages of the present invention will be apparent 
10 from the accompanying drawings, and from the detailed description, which 
follow below. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention is illustrated by way of example and not limitation 
in the figures of the accompanying drawings, in which like references indicate 
similar elements and in which: 

Figure 1 shows an exemplary closed loop data acquisition system 
architecture. 

Figure 2 shows an exemplary open loop data acquisition system 
architecture. 

Figure 3 is a flowchart representing the method for automated system 
identification. 

Figure 4 is a flowchart representing one embodiment of the process of 
preparation for point modeling. 

Figure 5 is a flowchart representing one embodiment of the process of 
generation of reference trajectories and data collection. 

Figure 6 is a flowchart representing one embodiment of the process of 
detection and removal of outliers. 

Figure 7 is a flowchart representing one embodiment of the process of 
verification of the model accuracy. 
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DETAILED DESCRIPTION 

A method for automated system identification is described. In the 
following description, for purposes of explanation, numerous specific details 
are set forth in order to provide a thorough understanding of the present 
5 invention. It will be evident, however, to one skilled in the art that the present 
invention may be practiced without these specific details. 

In some instances, well-known structures and devices are shown in block 
diagram form, rather than in detail, in order to avoid obscuring the present 
invention. These embodiments are described in sufficient detail to enable those 
10 skilled in the art to practice the invention, and it is to be understood that other 
embodiments may be utilized and that logical, mechanical, electrical and other 
changes may be made without departing from the scope of the present 
invention. 

Some portions of the detailed descriptions that follow are presented in 
15 terms of algorithms and symbolic representations of operations on data bits 
within a computer memory. These algorithmic descriptions and 
representations are the means used by those skilled in the data processing arts 



J- J to most effectively convey the substance of their work to others skilled in the 

art. An algorithm is here, and generally, conceived to be a self-consistent 



20 sequence of acts leading to a desired result. The acts are those requiring 

physical manipulations of physical quantities. Usually, though not necessarily, 
these quantities take the form of electrical or magnetic signals capable of being 
stored, transferred, combined, compared, and otherwise manipulated. It has 
proven convenient at times, principally for reasons of common usage, to refer 

25 to these signals as bits, values, elements, symbols, characters, terms, numbers, 
or the like. 

It should be borne in mind, however, that all of these and similar terms 
are to be associated with the appropriate physical quantities and are merely 
convenient labels applied to these quantities. Unless specifically stated 
30 otherwise as apparent from the following discussion, it is appreciated that 
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throughout the description, discussions utilizing terms such as "processing" or 
"computing" or "calculating" or "determining" or "displaying" or the like, refer 
to the action and processes of a computer system, or similar electronic 
computing device, that manipulates and transforms data represented as 
5 physical (electronic) quantities within the computer system's registers and 

memories into other data similarly represented as physical quantities within the 
computer system memories or registers or other such information storage, 
transmission or display devices. 

The present invention can be implemented by an apparatus for 
10 performing the operations herein. This apparatus may be specially constructed 
for the required purposes, or it may comprise a general purpose computer, 
selectively activated or reconfigured by a computer program stored in the 

O 

al computer. Such a computer program may be stored in a computer readable 

U1 

pjj storage medium, such as, but not limited to, any type of disk including floppy 

Jff 15 disks, optical disks, CD-ROMs, and magnetic-optical disks, read-only memories 

IssJ 

01 (ROMs), random access memories (RAMs), EPROMs, EEPROMs, magnetic or 

Ul 

a optical cards, or any type of media suitable for storing electronic instructions, 

" and each coupled to a computer system bus. 

^ The algorithms and displays presented herein are not inherently related 

□ 20 to any particular computer or other apparatus. Various general purpose 

systems may be used with programs in accordance with the teachings herein, or 
it may prove convenient to construct more specialized apparatus to perform the 
required method. For example, any of the methods according to the present 
invention can be implemented in hard-wired circuitry, by programming a 
25 general purpose processor or by any combination of hardware and software. 
One of skill in the art will immediately appreciate that the invention can be 
practiced with computer system configurations other than those described 
below, including hand-held devices, multiprocessor systems, microprocessor- 
based or programmable consumer electronics, network PCs, minicomputers, 
30 mainframe computers, and the like. The invention can also be practiced in 
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distributed computing environments where tasks are performed by remote 
processing devices that are linked through a communications network. The 
required structure for a variety of these systems will appear from the 
description below. 

5 The methods of the invention may be implemented using computer 

software. If written in a programming language conforming to a recognized 
standard, sequences of instructions designed to implement the methods can be 
compiled for execution on a variety of hardware platforms and for interface to a 
variety of operating systems. In addition, the present invention is not described 
10 with reference to any particular programming language. It will be appreciated 
that a variety of programming languages may be used to implement the 
teachings of the invention as described herein. Furthermore, it is common in 
the art to speak of software, in one form or another (e.g., program, procedure, 
application...), as taking an action or causing a result. Such expressions are 
15 merely a shorthand way of saying that execution of the software by a computer 
causes the processor of the computer to perform an action or produce a result. 

Figure 1 shows an exemplary closed loop data acquisition system 
architecture 100. In one embodiment, the present invention is described in 
connection with closed loop system identification. Alternatively, the present 
20 invention may be implemented with open loop system identification. 

However, closed loop system identification should be used when the system to 
be identified is unstable. 

Referring to Figure 1, a system 110 is instrumented to provide injection 
of a reference signal r and automatic collection of input signals u and output 
25 signals y. In one embodiment, the reference signal r, also known as excitation 
signal or reference trajectory, is injected at the input of system 110. 
Alternatively, the reference signal r may be injected at the output of system 110. 
The distinction between reference signal r and input signal u is provided for 
closed loop identification purposes. In one embodiment, the reference signal r 
30 resides in a table or data file 150 and is injected using a computer interface 130. 
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Conversely, input signals u and output signals y are collected and stored in a 
separate table or data file 140 using the same computer interface 130. In this 
embodiment, the controller 120 is provided to control the system 110. 
Figure 2 shows an exemplary open loop data acquisition system 
5 architecture 200. Referring to Figure 2, a system 210 is instrumented to provide 
injection of a reference signal r and automatic collection of input signals u and 
output signals y. In one embodiment, the reference signal r is injected at the 
input of system 210. In contrast to closed loop identification, in the case of open 
loop identification, the reference signal r and the input signal u are identical. In 
10 one embodiment, the reference signal r resides in a table or data file 250 and is 
injected using a computer interface 230. Conversely, input signals u and output 
signals y are collected and stored in a separate table or data file 240 using the 
same computer interface 230. 

Figure 3 is a flowchart representing one embodiment of the method for 
i ^ 15 automated system identification of linear systems. According to one 
y embodiment, a system user or designer specifies a list of operating conditions. 

Ul For example, in the case of identification of a disk drive, the user may specify a 

ri temperature, a vibration condition, a track number, a surface number, and a 

£j n drive number. The user may also provide information about the sampling 

CI 20 frequency and the desired frequency bandwidth covered by a prospective 
Q model. 

Referring to Figure 3, at step 310, a list of point models to be identified is 
constructed from the list of operating conditions provided by the user. At step 
320, during preparation for point modeling, each model is set up as described in 
25 detail below and as illustrated in Figure 4. 

Subsequent to the preparation for point modeling, a decision is made at 
step 330 concerning the qualification of the system. If the system is qualified, 
based on system qualification tests described below in connection with Figure 4, 
then reference trajectories are generated and data is collected at step 340. If the 
30 system is not qualified, then the procedure is terminated. 
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At step 340, reference signals or trajectories are generated and data is 
collected as described in further detail below in connection with Figure 5. 

At step 350, outliers are detected and subsequently cleaned or removed. 
One embodiment of the process of detection and removal of outliers is 
described in further detail below in connection with Figure 6. 

Next, a decision is made at step 360 whether a sufficient amount of clean 
data is available for further processing. If the removal of outliers was 
successful, but the number of removed values exceeds a predetermined 
percentage of the number of stored output signal values, data is discarded and 
steps 340 and 350 are repeated. Once sufficient clean data is available, point 
model identification is performed at step 370. One embodiment of the point 
model identification procedure will be described in further detail below. 

At step 380, uncertainty coefficients are computed, wherein the 
coefficients correspond to transfer functions identified during the point model 
identification performed at step 370. 

At step 390, a decision is made whether the model obtained is accurate. 
In one embodiment, a model qualification procedure is performed. One 
embodiment of the model qualification procedure will be described in further 
detail below in connection with Figure 7. 

Finally, a refinement of the identification experiment is performed at 
step 395. Once the model is assessed as accurate at step 390, the entire process 
is repeated for each point model in the point model list. 

In alternate embodiments, the order of the steps to be performed may be 
modified without departing from the scope of the present invention. Similarly, 
it is to be understood by a person of ordinary skill in the art that some steps 
may be eliminated or made available as optional features. 

In one embodiment, the present invention is implemented in connection 
with a supervisory decision entity, which is periodically called to make 
complex decisions and interpret results of different steps of the method shown 
in the flowchart of Figure 3. In one embodiment, the decision entity is a Real- 
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Time Planner (not shown). The Real-Time Planner is described in detail in U.S. 
Patent Application Serial No. 09/345,172, filed June 30, 1999, entitled Real-Time 
Planner for Design, to Sunil C. Shah, Pradeep Pandey, Thorkell Gudmundsson, 
and Mark Erickson, and assigned to Voyan Technology Corporation of Santa 
5 Clara, California. Alternatively, another software application may be provided 
to make the required complex decisions. In another alternate embodiment, the 
user may interpret the results and make supervisory decisions. 

Preparation for Point Modeling 
The preparation for point modeling and model setup performed at step 
10 320 in Figure 3 will now be described in further detail. Figure 4 is a flowchart 
representing one embodiment of the process of preparation for point modeling. 
Referring to Figure 4, once the user specifies the list of operating conditions, 
system qualification tests are run at step 410. 
% I In one embodiment, a test is run to assess if the system satisfies the 

He? 

p 15 linearity assumptions. Particularly for the closed loop identification illustrated 
01 in Figure 1, the test may verify that the controller 120 is not introducing 

£J H 

~ " undesirable non-linearities. For example, a first reference signal r is initially 

injected. Subsequently, a second version of it, scaled by a predetermined factor, 

M is also injected. If the closed loop system 100 is linear, the resulting output 

O 

q 20 signals y must be scaled by the same factor. Alternatively, other qualification 

i - 

tests may be run, for example a quick reference trajectory improvement may be 
performed, and predicted and actual output signals y may be subsequently 
compared. In Figure 4, at step 420, a system qualification cost vector is 
computed as a result of the system qualification tests. The cost vector is further 

25 used to decide whether the system qualifies for application of the method. 

Generally, the present invention may be implemented with any linear 
model structure. A list of linear model structures is presented in L. Ljung, 
System Identification for the User, Prentice Hall, 1999. Bias of the model should be 
taken into consideration when selecting an appropriate model structure. Biased 

30 models may arise when the model structure is not sufficiently rich or when a 
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cost function associated with the model structure does not have a unique global 
minimum and, instead, presents several local minima. Model structures that 
may be used for identification include finite impulse response (FIR), 
autoregressive with external input (ARX), autoregressive moving average with 
external input (ARMAX), autoregressive moving average (ARMA), 
autoregressive autoregressive with external input (ARARX), autoregressive 
autoregressive moving average with external input (ARARMAX), output error 
(OE), Box-Jenkins (BJ), and Ordinary Differential Equations (ODE). 

As shown in the flowchart of Figure 4, at step 430 a cost vector is 
simultaneously computed for each available model structure. A model 
structure is subsequently selected at step 440 based on the computed cost 
vector. 

Each known model structure has an associated cost vector, which can be 
computed based on several cost variables. Among the cost variables to be 
considered in the process of computing the cost vector are: a risk of local 
minima factor (r.l.m.), which depends on the type of system or physical plant; a 
computational cost factor (c.c), which is a characteristic of the particular model 
structure; and an equipment time factor (e.t.), which is related to the number of 
identification experiments necessary to obtain accurate models. In one 
embodiment, the cost vector is passed to the real-time planner, which selects the 
appropriate model structure. Alternatively, the designer may select the model 
structure based on the calculated cost vector. 

Referring to Figure 4, once the model structure is selected at step 440, a 
model order is selected at step 450. Additional model parameters, such as an 
input signal delay or a disturbance model order, are selected at step 460. The 
model order selection is performed following the procedure described in detail 
in U.S. Patent Application Serial No. 09/345,640, filed June 30, 1999, entitled 
Model Error Bounds for Identification of Stochastic Models for Control Design, 
to Sunil C. Shah, and assigned to Voyan Technology Corporation of Santa 
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Clara, California. The selected model order may be revised according to the 
outcome of model qualification tests described in further detail below. 

At the same time, identification experiments are defined at step 470 and 
experimental parameters are obtained at step 480. In one embodiment, one or 
5 more operating conditions are specified for the model structure selected. At the 
same time, a sampling frequency and a frequency bandwidth covered by the 
model structure are also provided. The identification experiments are then 
defined based on the operating conditions, the sampling frequency, and the 
frequency bandwidth of the model structure. 
10 In one embodiment, the system identification of a point model can be 

performed subsequent to one or more identification experiments, depending on 
the number of input signals u and output signals y, as well as on the 

experimental setup available and the length of data required by the model 

Ul 

ni structure to obtain an accurate model. In the embodiment shown in Figure 1, 

15 table or data file 150 contains the reference signal r, also known as excitation 
8s signal or reference trajectory. Data collected during the identification 

a experiment, such as input signals u and output signals y, is stored in table or 

" data file 140. System input and output identifiers, as well as any other 

experiment-specific parameters, may be stored in a separate table or data file 
O 20 (not shown). Finally, a header file may store the plant input and output 
identifiers, as well as other experiment-specific parameters. 

A quiet run is defined as an identification experiment having a table or 
data file 150 containing a null reference signal r. Similarly, an excited run 
corresponds to an identification experiment having a non-zero excitation signal 
25 r. In one embodiment, an identification experiment corresponding to an excited 
run contains information on a predetermined frequency region only. As a 
result, the total frequency bandwidth, initially specified by the user, is 
separated into several excited runs according to the maximum duration of each 
identification experiment. 
30 Qualification of the System 
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Subsequent to the preparation for point modeling, a decision is made 
concerning the qualification of the system performed at step 330 in Figure 3. In 
one embodiment, the real-time planner decides whether the system is qualified 
or not. Alternatively, the decision may be made by another software 
5 application or by the system designer. If the system qualifies, then reference 
trajectories are generated and data is collected. If the system does not qualify, 
then the procedure is terminated. 

Generation of Reference Trajectories and Data Collection 
The generation of reference signals or trajectories and data collection, 

10 performed at step 340 in Figure 3, will now be described in further detail. 
Figure 5 is a flowchart representing one embodiment of the process of 
generation of reference trajectories and data collection. 

As illustrated in the flowchart of Figure 5, the model parameters and the 
experimental parameters, obtained at steps 450, 460 and 480 shown in Figure 4, 

15 are used to create reference signals r at step 510. In one embodiment, this step 
creates tables or data files 140 and 150 containing data corresponding to all 
identification experiments for a point model. In order to create a reference 
signal or trajectory r, a large output signal-to-noise ratio must be obtained and 
linear operation regime must be guaranteed for the point model. 

20 In one embodiment, chirp signals may be used as reference signals. In 

this embodiment, the envelope of an output signal is retrieved and divided by 
the envelope of a corresponding chirp signal in order to estimate an 
input/output gain. Subsequently, to obtain a new chirp signal envelope, a 
desired output level is divided by the calculated input/ output gain. The 

25 resulting chirp signal is conditioned to account for possible non-linear effects, 
such as system-input saturation and system-input slew-rate limits. 

Alternatively, other signals may be used as reference signals, such as 
pseudo-random binary sequence signals, a sum of sinusoids, or wavelets. The 
creation of reference trajectories is disclosed in further detail in U.S. Patent 

30 Application Serial No. 09/345,640, filed June 30, 1999, entitled Model Error 
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Bounds for Identification of Stochastic Models for Control Design, to Sunil C. 
Shah, and assigned to Voyan Technology Corporation of Santa Clara, 
California. The reference signals are stored in table or data file 150 at step 520. 
As shown in the flowchart of Figure 5, identification experiments are 
5 performed at step 530. The chirp signal is injected into the system 110 and, as a 
result, the input signals u and output signals y are collected and stored in table 
or data file 140 at step 540. The process of generation of reference trajectories 
and collection of data is iterative. At step 550, the procedure is repeated for 
each identification experiment and results are collected and stored in respective 
10 tables or data files. 

Detection and Removal of Outliers 
One embodiment of the detection and removal of outliers performed at 
step 350 in Figure 3 will now be described in further detail. Figure 6 is a 
flowchart representing one embodiment of the process of detection and 

15 removal of outliers. 

As a result of possible malfunctions in sensors or in the data acquisition 
system, experimental data may have outliers. Outliers are defined as 
input/output pairs of signals that do not correspond to normal operation of the 
system 110 of Figure 1. For example, with respect to a disk drive, errors in the 

20 gray code or in the servo burst readings may generate erroneous sensor 
readings in the position error signal (PES). 

Outliers are identified using a linear prediction or smoothing filter. The 
linear filter is based on a linear model of a low order. As shown in the 
flowchart of Figure 6, at step 610, the linear model is constructed and an error 

25 signal e(t) is computed as a difference between a predicted output signal value 
corresponding to the linear and the stored output signal y value obtained from 
the identification experiment. At step 620, a decision is made whether e(t) is 
greater than a predetermined threshold. In one embodiment, the real-time 
planner automatically decides whether e(t) is greater than the threshold. 

30 Alternatively, the decision may be made by another software application or by 
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the system designer. If e(t) is greater than the threshold, then the output signal 
y is declared an outlier, and its value at time t, namely y(t), is stored in an 
outlier list at step 630. The outlier detection process is iterative for various time 
values between zero and t max . If e(t) is lower than the threshold, then steps 620 
5 and 630 are repeated for a different time value. 

The content of the outlier list is verified at step 640. If the outlier list is 
empty, the data is considered clean at step 645. If the outlier list is not empty, 
outliers are cleaned at step 650. The cleaning procedure is used when outliers 
are rare. During cleaning, measured data is replaced at step 650 by the 
10 predicted output signal value corresponding to the same time value t. At step 
660, a decision is made whether the cleaning procedure was successful. In one 
embodiment, the decision is based on a new linear prediction or smoothing 
filter constructed from the clean data. If the cleaning was successful, data is 
S i considered clean at step 665. Otherwise, outliers are removed at step 670. 

15 The removal procedure typically occurs when outliers are grouped in 

clusters and replacement with predicted output values is not possible. At step 
670, data is removed and the identification experiment is split at the removal 
point. At step 680, a decision is made whether the removal procedure was 
successful. In one embodiment, the decision is based on a new linear prediction 
20 or smoothing filter constructed from the remaining clean data. If the removal 
was successful, data is considered clean at step 685. Otherwise, the entire set of 
measured data is discarded at step 690. 

Amount of Clean Data 
Subsequent to the detection and removal of outliers, at step 360 in Figure 
25 3, a decision is made whether a sufficient amount of clean data is available for 
further processing. In one embodiment, the real-time planner automatically 
decides whether the resulting clean data is sufficient for further processing. 
Alternatively, the decision may be made by another software application or by 
a system designer. If the removal was successful, but the number of removed 
30 values exceeds a predetermined percentage of the stored output signal values, 

16 
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the data is discarded. As a result, the generation of reference trajectories and 
data collection and the detection and removal of outliers are repeated using a 
new set of data. If sufficient clean data is available, point model identification is 
performed. 

Point Model Identification 

The point model identification procedure, shown at step 370 in Figure 3, 
will now be described in further detail. 

In one embodiment, two multi-input multi-output (MIMO) models 
describe each point model. These two MIMO models are an input/output 
model represented by a transfer function matrix G and a disturbance model 
represented by a transfer function matrix H. If the system is subject to periodic 
disturbances, a third MIMO model may be identified using fictitious sinusoidal 
input signals, as described in U.S. Patent Application Serial No. 09/345,166, 
filed June 30, 1999, entitled Adaptation to Unmeasured Variables, to Sunil C. 
Shah, and assigned to Voyan Technology Corporation of Santa Clara, 
California. 

In one embodiment, the input/ output model and the disturbance model 
are simultaneously identified from the clean data that results from step 360 in 
Figure 3. Alternatively, the disturbance transfer function that best explains the 
data is determined using a previously provided input/output model. In one 
embodiment, the user provides the input/output model. Alternatively, the 
input/ output model may be obtained from previous identification experiments. 
The identification of the disturbance model using a stable input/ output model 
is known and has been described in L. Ljung, System Identification for the User, 
Prentice Hall, 1999. However, this procedure requires the implementation of a 
filter that contains all the modes of the input/ output model. When this model 
is unstable, the filter cannot be implemented, because it will contain the 
unstable modes of the input/output model. A method which circumvents this 
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obstacle and enables the identification of disturbance models when the 
input/output model is unstable will be described in detail below. 

Let u(t) and y(t) be the input and output signals to a linear system M. 
Then, the linear system M is characterized by the following expression: 

M: y{t) = G{q)u{t)+H{q)e{t) 
where G and H are linear transfer functions, e(t) is a unit covariance white noise 
signal, t is the sample index, and q is a delay operator. Transfer function G 
relates to the input /output model, while transfer function H corresponds to the 
disturbance model. If G is unstable, then it can be factored as a product of two 
transfer functions: 

G(q) = G; x (q)G t (q) 

where G" 1 includes all the unstable modes of G, G s includes all the stable 
modes of G, and both G u and G s are stable transfer functions. 

Following standard system identification procedures, the prediction 
error associated with the above y(t) expression is computed as follows: 

e(t) = H- x [y(t)-G{q)u{t)} = H' l v 

Since G is known, then all components of signal v are known. However, 
because G is unstable, v cannot be computed using the above equation. 
Therefore, the two stable transfer functions G u and G s must be used in the 
computation as follows: 

e{t)=H- l \y(t)-G; x {q)G s {q)u{t)\ 
= H- l G; l [G u y{t)-G s {qHt)] = H- l v 

where H =G U H and v = G u y{t)-G s {q)u(t). 

The above equation represents a system with no control or external 
input. Typically, these systems are described by either an AR model structure 
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or an ARMA model structure depending on the parameterization chosen for 



H. 

Let p be the number of output signals and r the number of input signals. 

The signal v = G u y(t)-G s (q)u(t) is computed based on G u and G s . H is then 

obtained considering an AR model structure or an ARMA model structure. 
1. Using an AR model structure: A u ..., A n , are identified, where 

A i G^K PxP , such that 



v(t) = e(t)- A, v(f - 1)- • - A„ v(t -n)=e{t)-[A i ---A n ] 



v(t-l) 
v{t-n) 



The above equation is solved for matrices A,, i=l,. . .,n, using a known 
least squares (LS) algorithm. The resulting transfer function is 

H(q) = [l + A iq - l +- + A n q- n Y l 

2. ARMA model: Identify A, . . .,A n and B 0 . . .,B m , where A, 6 <R PxP 

and BjGSH^, such that 

v(0 = B 0 e(t)+- ■ ■ + B m e(t -m)-A l v(t - 1) A n v(t - n) 

The ARMA model is obtained after an iterative procedure that 
minimizes the following cost function 

where p is the number of outputs of the MIMO system, and 
$ = [ A, • • • A n B 0 • • B m ] . The resulting transfer function is 

H(q) = [i + A x q~ x + ■ ■ ■ + A n q- [b o + ■ ■ ■ + B m q- m J 
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Finally, the disturbance model is reconstructed as H - G~ l H . 
Uncertainty Computation 

Subsequent to point model identification, uncertainty bounds 
corresponding to each of the transfer functions G and H introduced above are 
computed. One embodiment of the computation of the uncertainty bounds has 
been described in detail in U.S. Patent Application Serial No. 09/345,640, filed 
June 30, 1999, entitled Model Error Bounds for Identification of Stochastic 
Models for Control Design, to Sunil C Shah, and assigned to Voyan 
Technology Corporation of Santa Clara, California. 

Model Qualification 

Subsequent to the uncertainty computation, a decision is made whether 
the model obtained is accurate. In one embodiment, a model qualification 
procedure is performed and the real-time planner evaluates the accuracy of the 
model. Alternatively, the decision may be made by another software 
application. In another alternate embodiment, a system designer may decide 
whether the model is accurate or not. One embodiment of the model 
qualification procedure will now be described in further detail. 

Figure 7 is a flowchart representing one embodiment of the model 
qualification procedure, i.e. a process of verification of the model accuracy. 
Referring to Figure 7, several tests are performed in order to qualify the model. 
In one embodiment, the real-time planner makes decisions related to the test 
results and the accuracy of the model. Alternatively, the system designer may 
decide whether the test results show that the model is accurate. 

An analysis of the statistical properties of innovation signals is 
performed at step 710. If the innovation signals are white stochastic signals, 
(uncorrected with past measurements), then the model is accurate. This model 
test is described in L. Ljung, System Identification for the User, Prentice Hall, 1999. 
At step 720, a decision is made whether the test performed at step 710 was 
successful. If the test was successful, quiet run data is analyzed at step 730. If 
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the test was not successful, then the identification experiment must be refined at 
step 395. 

The analysis of data collected during the quiet run is performed at step 
730. Frequency domain identification techniques are not consistent for closed 
5 loop identification. However, the accuracy of the model obtained can be 

qualified by comparing it with a transfer function estimate, computed as a ratio 
of system input and system output spectral estimates. Specifically, the spectral 
estimates obtained from quiet run are used to avoid confusion with the spectral 
modes injected by the reference signal in the excited run. The closed loop 
10 system is assumed to be excited by large disturbances. At step 740, a decision is 
made whether the model explains the quiet run data. If prominent features 
n observed in the spectral estimates obtained from the quiet run are present in the 

»f model, then the model is considered accurate and it is said that the model 

RJ explains the quiet run data. If not, the identification experiment must be 

jj s ii 

p 15 refined at step 395 and the model order has to be increased in order to account 
for system dynamics not captured by the model. 

If the model explains the quiet run data, then analysis of model error 
bounds is performed at step 750. Frequency regions corresponding to large 
model errors are detected and analyzed. If such frequency regions can be 
20 detected at step 760, then the identification experiment must be refined at step 
395 and the reference signal may be modified by increasing the sweep time 
corresponding to those frequency regions. If such frequency regions cannot be 
detected, then the identification is considered successful at step 770. 

Refinement of Identification Experiment 
25 Finally, subsequent to model qualification, the refinement of the 

identification experiment is performed at step 395 in Figure 3. In one 
embodiment, the main variables available for refinement are the model order, 
the total time length of experimental data, i.e. the number of identification 
experiments, and the excitation signal. In one embodiment, the cost needed by 
30 the real-time planner to make a decision is computed. For example, performing 
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new experiments has a high equipment time cost, while increasing the order of 
the model results in an increase in the total computational cost. The calculated 
costs are evaluated based on a priori estimates and data gathered in the steps 
described above. 

5 In one embodiment, once the model is assessed as accurate, the entire 

process is repeated for each available point model in the point model list. 
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